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Abstract 

The main aim of the paper is to give a short self-contained proof of the decidability 
of language equivalence for deterministic pushdown automata, which is the famous 
problem solved by G. Senizergues, for which C. Stirling has derived a primitive recur- 
sive complexity upper bound. The proof here is given in the framework of first-order 
grammars, which seems to be particularly apt for the aim. 
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1 Introduction 

The decidability question for language equivalence of two deterministic pushdown automata 
(dpda) is a famous problem in language theory. The question was explicitly stated in the 
1960s [lj (when language inclusion was found undecidable) ; then a series of works solving 
various subcases followed, until the question was answered positively by Senizergues in 
1997 (a full version appeared in |2])- G. Senizergues was awarded Godel prize in 2002 for 
this significant achievement. Later Stirling [5], and also Senizergues [3], provided simpler 
proofs than the original long technical proof. A modified version, which showed a primitive 
recursive complexity upper bound, appeared as a conference paper by Stirling in 2002 [6]. 

Nevertheless, even the above mentioned simplified proofs are rather technical, and they 
seem not well understood in the (theoretical) computer science community. One reason 
might be that the frameworks like that of strict deterministic grammars, which were chosen 
by Senizergues and Stirling, are not ideal for presenting this topic to a broader audience. 

From some (older) works by Courcelle, Harrison and others we know that the dpda- 
framework and strict-deterministic grammar framework are equivalent to the framework 
of first-order schemes, or first-order grammars. In this paper, a proof in the framework of 
first-order grammars is presented. 

Author's remark. I have been reminded that J. R. Biichi in his book "Finite automata, 
their algebras and grammars: towards a theory of formal expressions" (1989) argues that 



using terms is the way proofs on context-free grammars should be done. I have not managed 
to verify myself, but this can be an indication that the framework of first-order terms might 
be "inherently more suitable" here. 

In fact, the proof here shows the decidability of trace equivalence (a variant of language 
equivalence, coinciding with bisimulation equivalence on deterministic labelled transition 
systems) for deterministic first-order grammars; the states (configurations) are first-order 
terms which can change by performing actions according to the root-rewriting rules. To 
make the paper self-contained, a reduction from the dpda language equivalence problem 
to the above trace equivalence problem is given in an appendix. 

In principle, the proof is lead in a similar manner as the proofs of Senizergues and 
Stirling, being based on the same abstract ideas. Nevertheless, the framework of the first- 
order terms seems to allow to highlight the basic ideas in a more clear way and to provide a 
shorter proof in the form of a sequence of relatively simple observations. Though the proof 
is the "same" as the previous ones on an abstract level, and each particular idea used here 
might be embedded somewhere in the previous proofs, it is by no means a "mechanical 
translation" of a proof (or proofs) from one framework to another. Another appendix 
then shows that the framework chosen here also has a potential to concisely comprise and 
slightly strengthen the previous knowledge of the complexity of the problem, though the 
proofs in that part are not so detailed as in the decidability part. 

Author's remarks. I hope that the presentation here should significantly extend the 
number of people in the community who will understand the problem which seems to 
belong to fundamental ones; this might also trigger new attempts regarding the research 
of complexity. Another remark concerns the previous version(s) of this arxiv-paper where 
I also claimed to provide a smooth generalization of the decidability proof to the case of 
bisimilarity for nondeterministic first-order grammars. Geraud Senizergues was present at 
my talk at http: //www.lsv.ens-cachan.fr/Events/Pavas/ (20 January, 2011) and he later 
put a counterexample on arxiv: |http://arxiv.org/abs/1101.5046| (My mistake was, in fact, 
embarrassingly simple, mixing the absolute equivalence levels with the eq-levels relative to 
fixed strategies. At the moment, I do not speculate how this can be corrected.) 

In the rest of this section, the main ideas are sketched. A GNF (Greibach Normal 
Form) grammar Q = (A/ - , A, 1Z), with finite sets M of nonterminals and A of terminals, has 
the rewriting rules X — > aY\ . . . Y n where a G A and X, Y\ , . . . , Y n G M . Such a rewrite 
rule can be written as Xx — > Y\. . . Y n x, for a formal variable x, and read as follows: any 
sequence Xa G A/"* (a 'state', or a 'configuration') can perform action a while changing into 
Y\ . . . Y n a ; this includes the case when n = and thus Y\ . . . Y n = e, the empty word. The 
language L(a) is the set of words w = aia 2 ■ ■ ■ a m G A* such that a — h .. — ^> — ^ e. 

In a first-order grammar Q = (A/", A, TV) , each nonterminal X has a finite ar- 
ity (not only arity 1 like in the GNF grammar), and the (root rewriting) rules are 
Xx\ . . . x m — > E(xi, . . . , x m ) where m = arity(X) and E is a finite term over M where 
all occurring variables are from the set {xi, . . . , x m }. When G\, . . . , G m are terms then 
XGi . . . G m — % E(Gi, . . . , G m ) where E(Gi, . . . , G m ) denotes the result of substitution 
E(xi, . . . , x m )[Gi/xi, . . . , G m /x m }. Grammar Q is deterministic if for each pair X G Af, 
a G A there is at most one rule of the type Xx\ . . . x m — .... We note that states (con- 
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figurations) are no longer strings (as in the case of GNF grammars) but terms, naturally 
represented as trees. 

It is a routine to reduce dpda language equivalence to deterministic first-order grammar 
trace equivalence: two terms T,T' are equivalent, T ~ T', iff the words (traces) w G A* 
enabled in T (T —t Ti for some Ti) are the same as those enabled in T' . It is natural to 
define the equivalence- level EqLv(T, T') as the maximal k G N for which we have T T', 
which means that T,T' enable the same words upto length k; we put EqLv(T, T') = u iff 
T ~ T, i.e. iff T T for all k G N. 

Given a deterministic first-order grammar £ and an initial pair of terms To, Uq, the first 
idea for deciding T ~ Uq is to use the (breadth-first) search for a shortest word w which 
is enabled in just one of T , Uq. We call such a word as offending (adopting the viewpoint 
of a defender of the claim T ~ C/ ). 

To get a terminating algorithm in the case of To ~ Uq, we can think of some sound 
system enabling to establish for certain words u G A* that they are not offending prefixes, 
i.e. prefixes of offending words for T , U ; e.g., if T — > T and Uq — > T then u cannot 
be an offending prefix. We aim at completeness, i.e., look for some means which enable to 
recognize sufficiently many nonoffending prefixes, finally showing that e is not offending, 
which means T ~ Uq. 

We use a simple observation that the eq-level of a pair (T, U) can drop by at most 

1 when both sides perform the same action, and it really drops by 1 in each step when 
we follow an offending word. It is also easy to observe a congruence property of subterm 
replacement; in particular, given terms U\,U 2 with EQLv(Lq, U2) = k and Ti,T 2 with 
EqLv(Ti,T 2 ) > k + 1 where U\ has a subterm Ti, i.e. U\ = E{T\), by replacing Ti with 
T 2 we get for the arising U[ = E{T 2 ) that EqLv(C/(, C/ 2 ) = EqLv([A, C/ 2 ) = k; the eq-level 
has been unaffected, and moreover, even the offending words for {U[, U 2 ) are the same as 
the offending words for (Lq, U 2 ). 

The above simple observations allow to build a (sound and) complete system, when we 
add the notion of a basis, a finite set of pairs of 'equivalent heads' (tree-tops, tree-prefixes) 
E(x\, . . . , x n ), F(x\, . . . , x n ), for which we have E(T\, . . . , T n ) ~ F(T\, . . . , T n ) for every 
instance. To enable a smooth completeness proof, showing even the existence of a fixed 
sufficient basis B for each grammar Q (not depending on the initial pair), it is helpful to 
start immediately in a more general setting of regular terms, which are finite or infinite 
terms with only finitely many subterms (where a subterm can possibly have an infinite 
number of occurrences); such terms have natural finite graph presentations. 

The structure of the paper is clear from (sub)section titles. There are also two (above 
mentioned) appendices. 

2 Basic definitions and simple facts 

In this section we introduce the basic definitions and observe some simple facts on which 
the main proof is based. 

By N we denote the set {0, 1,2,.. .} of natural numbers; symbol oj is taken as the first 
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infinite ordinal number, which satisfies n < co and u — n = u for any n G N. 
For a set A, by A* we denote the set of finite sequences, i.e. words, of elements of A; the 
length of w G A* is denoted |tu|, and we use e for the empty sequence, so \e\ = 0. By A u 
we denote the set of infinite sequences of elements of A (i.e., the mappings N — > A). 
Given L C A* U A w and u G .4*, by w\L we mean the left quotient of L by w, i.e. the set 
{v e A* U \ uv e L}. By PREF(L) — {u e A* \ uv e L for some t> } we denote the set 
of (finite) prefixes of the words in L. 

(Deterministic) labelled transition systems and (stratified) trace equivalence 

A labelled transition system (LTS) is a tuple (S,A, (—^) a eA) where S is the set of states, 
A the set of actions and — ?>C S x S is the set of transitions labelled with a. 
We extend to relations —^-C. S x S for all w E A* inductively: s — ^> s; if s -^4 si 
and si — S2 then s s 2 - We say that s' is reachable from s by a word w if s —t s'. 
A state s G S enables (a trace) w G A*, denoted s if there is s' such that s s'. 
An infinite sequence a = aia 2 a 3 . . . G .4^ is enabled in Sq, written s if there are 

s i> s 2, s 3 ? • • • such that Sj ^4 s i+ i for all i G N. The trace-set of a state s G 5 is defined as 

trac(s) = {w e A* \ s -^}. 

For G N, we define TRAC- fe (s) = {w G .4* | w G TRAC(s) and |u>| < A;}; we also put 
trac%s) = {a G *4 W | s — >}. On the set S, we define the (trace) equivalence ~, and the 
family of equivalences A; G N, as follows: 

r ~ s TRAC(r) = TRAC(s) and r ~ fe s <^df TRAC- fc (r) = TRAC- fe (s). 
Observation 1 ~ = S x S. \/k G N :~fe2~fe+i- ~fc=~- 
The equivalence level, or the eq-level, of a pair of states is defined as follows: 
EQLv(r, s) = k if r ~ fc s and r 7^+1 s ; EQLv(r, s) = 00 if r ~ s. 

The shortest words showing nonequivalence for a pair r, s (if r 7^ s) are called offending 
words: OW(r, s) = {w \ w is a shortest word in (trac(t) \ TRAC(s)) U (trac(s) \ 
TRAc(r))}. The elements of PREF(OW(r, s)) are called offending prefixes for the pair 
(r, s). We now note some trivial facts, point 3 being of particular interest: 

Observation 2 (1.) r ^ s iff OW(r, s) = iff e £ PREF(OW(r, s)). 
f£j If r rfj s then EQLv(r, s) = \w\ — 1 /or any w G OW(r, s). 
// EQLv(r, s) = k and EQLv(r, g) > fc + 1 £/ien 
EQLv(g, s) — k and OW(r, s) = OW(g, s). 

An LT^ (S,A, (— — >)ae^) is deterministic if each — is a partial function, i.e.: if r — ^ si 
and r — % S2 then si = S2. (Recall now the left quotient operation u\L.) 
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Observation 3 In any deterministic LTS, if r — > r' then TRAc(r') = w\TRAC(r) ; and 
TRAC w (r') = w\TRAC w (r). Moreover, TRAC(r) = TRAC(s) implies TRAC w (r) = TRAC w (s). 

We use notation (r, s) — > (r 7 , s') as a shorthand meaning r — — >■ r' and s — — >■ s'. 

Proposition 4 Assume a deterministic iTS", and suppose (r,s) — > (r',s'). Then: 

1. EQLv(r, s) — \u\ < EQLv(r', s') particular, r ~ s implies r' ~ s'j/ 

2. ijr rf, s then EQLv(r, s) - |«| = EqLv(t', s') ij(f u G PREF(OW(r, s)); 
5. if u E PREF(OW(r,s)) then OW(r',s') = u\OW(r,s). 

Proof: This follows almost trivially from Observation [31 E.g., for Point 3. it is sufficient 
to note: if u G PREF(OW(r, s)) then G OW(r, s) iff u G OW(r', s'). □ 

Finite and infinite regular terms and their finite graph presentations 

We now give (a variant of) standard definitions of first-order terms, including infinite terms; 
we fix a countable set V = {xi, x%, X3, . . .} of (first-order) variables. 

Let us now assume a given finite set M of ranked symbols, called nonterminals (though 
we can also view them as function symbols). Each X G H thus has arity(X) G N; we use 
X, Y to range over elements of N '. 

A (general) term E over M (and V) is defined as a partial mapping E : N* — > M U V 
where the domain uom{E) C N* is prefix-closed, i.e. dom(£) = pref(dom(E)), and 
nonempty (e G DOM(_E)); moreover, for 7 G DOM(_E) we have 72 G DOM(£) iff 1 < % < 
arity(E '(7)) where the arity of variables Xj G V is viewed as 0. 

For each 7 G dom(£), by -E 1 ^] we denote the subterm occurring at 7 in where 
-E'[ 7 ](5) = E(j6) for each 5 G DOM(E[ 7 j) = 7\dom(£ i ); this occurrence of subterm E[ y ] has 
depth I7I in E 1 . 

For X e Af and terms Gi, G2, • • • , G m , where m = arity (X), by XG1G2 • • • G m we 
denote the term E for which -E(e) = X and = Gi for each length-1 sequence i where 
1 < i < m; X is the root nonterminal of this term i£. Each variable G V is also viewed 
as the term E for which J5(e) = Xj (and thus dom(£) = {e}). 

A term is finite (infinite) if dom(£) is finite (infinite). The depth-size of a finite 
term £7, denoted DEPTH(i^), is the maximal I7I for 7 G dom(£) (i.e., the maximal depth 
of a subterm-occurrence in E). A term is regular if the set of its subterms is finite (though 
the subterms can have infinitely many occurrences). 

By Terms^/" we denote the set of all (finite and infinite) regular terms, since we will 
not consider nonregular terms anymore. Hence from now on, when saying "term" we 
mean "regular term". GTerms^v" denotes the set of all (regular) ground terms, i.e. the 
terms in which no variables Xi occur. We use symbols T, U, V, W (possibly with sub- and 
superscripts) for ranging over GTermSaa; symbols E, F, G, H are used more generally, 
they range over Terms_v. 

Regular terms can be infinite but they have natural finite presentations, since they can 
be viewed as the unfoldings of finite graphs: 
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Definition 5 A graph presentation of a regular term is a finite labelled (multi) graph g, 
where each node v has a label X(v) G AfU V and m outgoing edges labelled with 1,2, ... ,m 
where m = arity(X(v)) (and where different edges can have the same target); moreover, 
one node is selected as the root. Graph g represents term T g as follows: dom(7^) consists 
of sequences of edge-labels of finite paths in g starting in the root; 7^(t) = A(V) where v' 
is the end-vertex of the path with the edge-label sequence 7. 

(Given A/",) we can naturally define a notion of the size of a graph presentation g, e.g., 
as the number of nodes of g, or, to be pedantic, as the length of a standard bit-string 
representation of g (thus also handling the descriptions of indexes of variables Xi). We define 
the presentation size of a term F, denoted PresSize(F), as the size of the least graph 
presentation of F. By PresSize(_E, F) for a pair E, F we mean the sum PresSize(E) + 
PresSize(F), say. On our level of reasoning, we do not need further technical details, 
since the facts like the following one are sufficient for us. 

Observation 6 

For any sGN, there are only finitely many pairs (E, F) with PresSize(_E, F) < s. 

We will also use some facts concerning an effective (algorithmic) work with finite presen- 
tations of regular terms. The next observation is an example of such a fact. Further we 
often leave such facts implicit. 

Observation 7 There is an algorithm which, given (Af and) graph presentations g\, g 2 , 
decides ifT(gi) = T(g 2 )- 

Substitutions, ground instances of a pair (E,F), a limit substitution 

By a substitution we mean a mapping a : V — > TERMS^-. The term Ea arises from E 
by replacing each occurrence x^ in E with o~(xi). (Since E and o(xi) are regular, Ea is 
regular.) By writing o = [G^/x^, . . . , G in /x in ] we mean that cr(xj) = G { if i G {h, . . . , i n } 
and cr(xi) = Xi otherwise. Substitutions can be naturally composed; associativity allows 
to omit the parentheses: Eaters = ((Eai) 0-2)02, = E(o± (02O3)) = (Eoi)(a 2 o 3 ), etc. 
F is an instance of E if there is a substitution o such that Eo = F. 

As usual, we sometimes write E(xi 1 , . . . , Xi n ) to denote the fact that all variables occur- 
ring in E are from the set {x^ ,...,#$„}. In fact, we only use the special case E(x±, . . . , x n ); 
note that even a ground term E can be viewed as E(x±, . . . , x n ), for any n. (We ignore the 
slight notational collision with the previous use E(^y), since this should cause no problems.) 

Convention. When writing F(G\, . . . , G n ), we implicitly assume F = F(x±, . . . , x n ), and 
we take F(G 1 , . . . , G n ) as a shorthand for F[Gi/xi, . . . , G n /x n ]. In particular we note that 
F(Gx(H 1: . . . , H m ), . . . , G n (H 1: . . . , H m )) = (F(G 1 , . . . , G n ))(H u . . . , H m ). 

A ground term U which is an instance of E is called a ground instance of E. We will in 
particular use the notion of a ground instance of a pair (E(x 1 , . . . , x n ), F(x±, . . . , x n )): it is 
any pair (Eo, Fo) where o is a substitution [Ui/xi, . . . , U n /x n ] where C/j are ground terms. 
We usually write (E(U U ...,U n ), F(U U U n )) instead of {Ea, Fo). 
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By writing Ea 1 we mean Ea; Ea k+1 (k G N) means Ea h a. We need just a special 
case of substitutions a for which Ea^ is well-defined; we use the graph presentations for 
the definition (which also shows another aspect of the effective work with these finite 
presentations). 

Definition 8 Given (a regular term) H we define H[H/xi\ u , denoted as H hnii , as follows: 
given a graph presentation g where T{g) = H, then H hnit = T(g') where g' arises from g 
by redirecting each edge leading to a node labelled with Xi (in g) to the root (in g'). (Hence 
if H = Xi or Xi does not occur in H then H hmi — H .) 

Head-tails presentations of terms, the d-prefix form of terms 

If F = E(G\, . . . ,G n ) then we say that the (regular) head E and the (regular) tails 
Gi,...,G n constitute a head-tails presentation of F. We can also note that the head 
E itself can be presented by a head-tails presentation E = G(Fi, . . . , F m ), say, etc. 
A particular head-tails presentation of a term is its <i-prefix form; it suffices when we 
restrict ourselves to ground terms: 

Definition 9 For a ground term V and d G N, the <i-prefix form of V arises as follows: 
we take all (ordered) occurrences of subterms of V with depth d (if any), say T\, . . . ,T n , 
and replace them with variables X\, . . . ,x n , respectively. We thus get a finite term Pj = 
Pj (x\, . . . , x n ), the d-prefix of V. The head Pj and the tails T 1; . . . , T n constitute the d- 
prefix form ofV = Pj(T 1} . . . , T n ). (Pj = V when V is a finite term with Depth(V) < d.) 

Observation 10 If m is the maximal arity of nonterminals in Af then the number n of 
tails in the d-prefix form is bounded by m d . 

We also note the next obvious fact. 

Observation 11 If F = F{x\) ^ X\ then F hmi is a ground term, and for any T we have 
pf l mi = pf^ T ) where a = [F/ Xl \. 

First-order grammars as generators of LTSs 

Definition 12 A first-order grammar is a structure Q = (J\f,A,lZ) where M is a finite 
set of ranked nonterminals, A is a finite set of actions (or terminals), and 1Z a finite set 
of (root rewriting) rules of the form 

XX!X 2 . . .x m E(xi,x 2 , . . . ,x m ) (1) 

where X G N ' , arity(X) = m, X\, x 2 , ■ ■ ■ , x m G V , a G A, and E is a finite term over M 
(and V) in which all occurring variables are from the set {x\ ) x 2l ■ ■ ■ ,x m }. (E = Xi is a 
particular example.) Grammar Q is deterministic, a det-first-order grammar, if for each 
pair X G M , a G A there is at most one rule of the form (Qp. 
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Remark. Context-free grammars in Greibach normal form can be seen as a special case, 
where each nonterminal has arity 1. Classical rules like A — > aBC, B — )■ b can be presented 
as Ax i — % BCxi, Bx\ X\. 

We view Q = (J\f,A,TZ) as a generator of the LTSg = (TermSa/-, „4, (-^)ae^) 

where each (root) rewriting rule Xx\X2 ■ ■ ■ x m — E(x\,X2, . . . , x m ) is a "schema" (a "tem- 
plate") of a set of transitions: for every substitution a = [Gi/xi, . . . , G m /x m ] (including a 
with a(xi) = Xi for all i) we have (Xxi . . . x m )a — > E(xi, . . . , x m )a, i.e. 

XG1G2 ■ ■ ■ G m — > E(Gi, G2, . . . , G m ). 

Observation 13 When Q is deterministic then LTSg is deterministic. 

Though the main result applies to deterministic grammars, we will also note some proper- 
ties holding in the general (nondeterministic) case. 

The notions and notation like F — > (w is enabled by F), F — > F' (for words w G A*), 
F (for a E A u ), trac(F), TRAC- fc (F), trac"(F), trace equivalence E ~ F, etc., 
are inherited from LTSg. (Note that the term Xi enables no actions; nevertheless, it is 
technically convenient to have also nonground terms as states in LTSg.) 
We will be interested in comparing ground terms, so we will use T ~ k JJ, T ~ U, 
EqLv(T,[/), OW(T,U), pref(OW(T,C/)) mainly for T,U G GTerms^. We note (now 
explicitly) another fact about the effective work with graph presentations: 

Observation 14 Given (Q and) a graph presentation g (of term T{g)), we can effectively 
find all rewriting rules in 1Z (of type (Tjpj which can be applied to T(g) and for each such 
rule yielding T{g) — ^ F we can effectively construct some g' such that T(g') = F. 

Root-locality of action performing, words exposing subterm occurrences 

Assuming a given first-order grammar Q = (Af, A, TV) , we observe some consequences of the 
fact that the root nonterminal of XG\ . . . G m determines if a G A is enabled, and when a 
transition is performed then the subterms G\, . . . , G m play no role other than being copied 
(or "lost") appropriately. 

Observation 15 For a G A and u G A* , we have au G TRAC(XG! . . . G m ) iff there is 
a rule Xx\ . . .x m — > E{x\, . . . ,x m ) in TZ such that u G trac(-E) or u = U\U2 where 
E — ^ xi and U2 G TRAc(Gj) (for some i, 1 < i < m). 

We say that w G A* exposes the i-th successor of X G M if Xx\ . . . x m — > X{. 

Observation 16 Viewing a (regular) term E as a partial mapping E : N* — > M U V, we 
note that there is u such that E — > x^ iff there is 7 G dom(E) where E(j) = Xi and for 
each prefix Sj of ^ there is some w exposing the j-th successor of E(5). 
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Observation 17 trac(E(Gi, . . . ,G n )) = 

trac(£'(xi, . . . , x n )) U U Kl <„{™ | E Xj and u G TRAc(Gj)}. 

We now look at some simple algorithmic consequences of the above observations. 

Proposition 18 There is an algorithm which, given Q = (Af, A, TV), computes (and fixes) 
a word w(X,i) for each pair (X,i), where X G Af, 1 < i < arity(X), so that w(X,i) is 
a shortest word exposing the i-th successor of X if any such word exists and w(X,i) = e 
otherwise. 

Proof: Recalling Observations [T5l and [T6| a brute-force systematic search of all w(X,i) 
is sufficient: we finish when finding that the remaining pairs (X, i), i.e. those for which 
w(X, i) have not been computed, are mutually dependent, i.e., the existence of an exposing 
w(X, i) for any of the remaining pairs depends on the existence of exposing words for some 
remaining pairs. □ 

For later use we note that we can compute a bound bigger than any \w(X,i)\, say 

M = 1 + max{ \w(X, i)\ | X G Af, 1 < i < arity(X) } . (2) 

We also note the bounded increase of the depth-size of finite terms, given by the fact that 
the right-hand sides of the (finitely many) rules ([1]) in 1Z are finite terms. 

Observation 19 (ForQ,) there is a (linear) nondecreasing function B-Inc : N — > N such 
that: if F F' for a finite term F then Depth(F') < Depth(F) + B-Inc(|w|). 

Generally we cannot provide a similar lower bound for Depth(F'), since some X{ might 
not occur in E in But we can recall the d-prefix form from Definition [9] (defined for 
ground terms) and note the following obvious fact. 

Observation 20 If V = Pj(T x , . . . , T n ) and \u\ < d then V V iff there is some 
(finite) E such that Pj (x\, . . . , x n ) — — >■ E(x\, . . . ,x n ) and V = E(T\, . . . ,T n ) (where 
Depth(E) < d + B-Inc(M)/ Hence if = Pj then U ~ d V. 

Congruence property of ~fc and ~ 

Proposition 21 IfT ~ fc V then E(T) ~ fc E{T'). (T ~ V implies E{T) ~ E{T').) 

Proof: The claim follows from Observation [T7J trac(E(T)) consists of traces w G 
trac(-E'(xi)) and of traces of the form w = uv where E(x\) — > x\ and v G trac(T). □ 

Proposition 22 IfT ~ fe F(T) where F ^ x t then T ~ fc F lim \ 
Hence T ~ F(T) implies T ~ F Hmi . 
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Proof: If T ~fe F(T) then by repeated use of Proposition [2T1 we get T ~ fc Fa k (T) where 
a = [F/x x ]. By Observation QT] we get P^ k( - T) = Pf'™ 1 and thus Fa k (T) ~ fc F Zimi (by 
Observation 120]) . which implies T ~ fc F hmi . □ 

Corollary 23 // Tj #(71, . . . ,T n ) where H ^ x { then T { ~ fc #" mi (Ti, . . . ,T n ) (with 
no occurrence of Xi in H hmi ). In particular, if T n ~& H(Ti, . . . , T n ) where H ^ x n then 
T n ~fc H limn (Ti, . . ., T n _i). 

A normal form for first-order grammars 
Observation 24 

// there is no u such that E{x\) — > x\ then E(T) ~ E(T') for any T, T' . 

Definition 25 Q = (Af,A,lZ) is in normal form if for each X G J\f and i,l < i < 

arity(X) there is (a shortest) w(X,i) exposing the i-th successor of X . 

Proposition 26 Any Q = (A/", A, 1Z) can be effectively transformed to Q' = (A/"', A, TV) in 
normal form, yielding also an effective mapping trans : Terms^ — > Terms^/ 7 such that 
T ~ TRANS(T) (in the disjoint union of LTSg and LTSgi). 

Proof: For each Y G Af, by ESy = (ii, i%, ■ ■ ■ , i\ Y ) (Exposable Successors of Y) we 
denote the subsequence of (1, 2, ... , arityiY)) where i G ESy iff there is w exposing the 
z-th successor of Y; by Y' we denote a fresh nonterminal with arity(Y') = ky ■ We put 
trans (x,) = Xj and trans (Yd ... G m ) = Y' trans (G iY ) ... trans (G { y ). Each rule 

1 fey 

Xx\ . . . x m — E(xi, . . . , x m ) in 1Z is transformed to the rule X'x\ . . . Xk x — — >■ trans(E)o~ 
where a = [xi/x^, . . . , Xk x /x { x ]; thus 1Z' arises. This guarantees that Yx\ . . . x m — — >■ 

iff Y'x\ . . . Xk Y — > Xj. The claim now follows from Proposition [18] and Observation 1241 □ 



Exposing equations for pairs (E, F) in deterministic LTSg 
We now note an important notion and make some observations. 

Definition 27 We say that u G A* exposes an equation for the pair 

(E(xt, . . .,x n ),F(xi, . . .,x n )) ifE(xt, ...,x n ) x i; F(x x , ...,x n ) H{x u . . .,x n ), or 

vice versa, where H(xi, . . . , x n ) ^ X{ (but might be H = Xj for i ^ j). 

Formally we write this exposed equation as Xi = H(xi, . . . ,x n ). 

Proposition 28 For deterministic Q, if E(T[,...T^) ~^ F(T{, . . . ,T^) and 
E(T 1: ...T n ) rf k F(Ti, ...,T n ) then an offending prefix for (E(Tx, . . . , T n ), F(T U T n )) 
exposes an equation for E,F. 
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Proof: There is surely no u e PREF(OW(S(2i, . . . , T n ), F(7\, . . . , T n ))) such that 
(E,F) {x h Xi). If there is e OW (E(T U . . . , T n ), F(T U . . . , T n )) (where \w\ < k) 
such that (E, F) — y (E', F') where none of E', F' is a variable and E' rf\ F' then wa 
witnesses that E{T[, . . . , T,Q 7^ -^(^i, . . . , T^) - a contradiction. □ 

We recall that Observation [3] and Proposition H] apply to LTSg when is deterministic, 
and observe the following: 

Observation 29 For deterministic Q, if u exposes an equation Xi = H(x\, . . . ,x n ) for 
E,F then EqLv(E(T 1 , T n ), F(Ti, T n )) — \u\ < EqLv^-, H{T x , . . . , T n )) (for any 
T u . . . , T n ). Thus if E(T U . . . , T n ) ~ F(T U . . . , T n ) then T t ~ H lim > (T u T n )); otherwise 
(at least) T t ~ fc H l ^{T u . . . ,T n )) for k = EqLv (E(T U . . .,T n ),F(T u . . . ,T n )) - \u\. 

3 A "word-labelling predicate" |= and its soundness 

Definition 30 (Given Q ,) a pair (E(xx, . . . ,x n ), F(xi, . . . ,x n )) is sound if we have 
Eo ~ Fa for all ground instances (Ea, Fa) (i.e., (E(Ti, . . . , T n ), F(Ti, . . . , T n )) ) of the 
pair (E, F). A set B of pairs (E, F) is sound if each element is sound. 
By GlNST(i3) we mean the set of all ground instances of pairs in B. 

Let us now assume a given deterministic first-order grammar Q = (Af, A, 1Z) in normal 
form, and a finite set B, called a basis, of pairs (of regular terms) (E,F), supposedly 
sound; we always assume B contains the pair (xx,Xi) which is obviously sound. 

The derivation (or deduction) system in Figure assuming Q and B, provides an 
inductive definition of the predicate 

\=(t ,u ) C A* x ((GTerms^ x GTerms^) U {NOP, FAIL}). 

So it is in fact a family of predicates, parametrized with the initial pair (T , Uq) of regular 
ground terms. We write just \= instead of \=(t ,u ) when (T , Uq) is clear from context. 

We can see that Axiom, Basic transition rule (1.) and Rejection rule (6.) guarantee 
that if T ^ JJq and ua is an offending word for (T , Uq), which implies (T , Uq) — > (T, U) 
where T 7^ U, then u |= (T, U), which can be read as "w can be labelled with the pair 
(T, £/)", and thus e \= FAIL; as expected, e |= FAIL is intended to mean T 7^ Uq. 

In the case T ~ Uq we are guaranteed that if (T , Uq) — > (T, U) then u \= (T, U) but 
it is not clear how to use this to conclude that Tq ~ Uq. To this aim we introduce another 
"label": u |= NOP (read u u can be (also) labelled with NOP") is intended to imply that 
u is Not an Offending Prefix for {Tq, Uq) if B is sound. Thus e \= NOP is intended to 
imply Tq ~ Uq if B is sound. Having this in mind, Basis rule (4.) is clear; we will later 
realize the reason for the condition u 7^ e. Bottom-up progression rule (5.) is clear as well; 
we note in particular that it enables to derive u |= NOP when u |= (T, U) and T, U do not 
enable any action. 

The most interesting is the Limit subterm replacement rule (2.), with its particular 
case of Subterm replacement rule. It allows to label u also with other pairs than those 
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• (Axiom) e \= (T ,U ) 

• (Primary derivation (deduction) rules, determining when u \= (T,U)) 

1. (Basic transition) 

If u |= (T, U), T ~x U, (T, £/) -^4 (T' 3 U') then ua |= (T', U'). 

2. (Limit subterm replacement) 

If u |= (E(T), U), \v\ < \u\, v \= (T, F(T)), F ^ x x then u |= (£(F' imi ), 17). 
(A particular case is Subterm replacement: if v |= (T, T") then u |= (E(T r ), U).) 

3. (Symmetry) If w |= (T, [/) then it |= (U,T). 

• (Secondary derivation rules, determining when u |= NOP and/or u \= FAIL ) 

4. (Basis) Uu^e,u\= (T, C/) and (T, [/) e GlNST(i3) then u \= NOP. 

5. (Bottom-up progression) If u |= (T,U), T ~i [/, and wa |= NOP for all a 
enabled by T (and U) then u |= NOP. 

6. (Rejection) If u |= (T, £/) where T fa U then e (= FAIL. 

Figure 1: Inductive definition of \=(t ,u ) (for given Q, B) 

derived just by Basic transition; so one u can get many pairs of terms as "labels". This is 
meant to help to create instances of the basis and label the respective words with NOP. 
The condition |i>| < \u\ is important for soundness of the predicate |= (wrt its intended 
meaning). The symmetry rule (3.) could be dropped if we included all symmetric cases in 
the Limit subterm replacement rule. 

As usual, we write u |= (T, U), or u \£ (T, U), if the predicate is true (i.e., derivable) for 
the triple u,T, U, or not true (not derivable), respectively; similarly for NOP and FAIL. 

We now recall Observation [3] and Proposition HJ and show the following generalization 
in our case of det-first-order grammars. This is the crucial point for showing soundness 
(Proposition [33]). (In fact, Point 3. is used later in the completeness proof.) 

Proposition 31 Given (a det-first-order grammar Q and) an initial pair (T q ,Uq): 

1. Ifu\= (T, U) then EqLv(T , U ) - |u| < EqLv(T, U). (T ~ U implies T ~ U .) 

2. If T ^ U and u (= (T, U) where u E pref(OW(T , U )) then 

EqLv(T , Uo) - \u\ = EqLv(T, U) and OW(T, U) = m\OW(T , U ). 

3. If T ~ Uo and u |= (T, U) then trac(T) = trac(£7) = m\trac(T ) = w\trac(£/o) ■ 

Proof: We proceed by induction on the length of derivations. The axiom e \= (To, Uq) 
trivially satisfies the conditions. Suppose that the conditions are satisfied for all u \= (T, U) 
derived by derivations upto length m, and consider a derivation deriving u' |= (T', U') by 
m + 1 applications of the derivation rules. 



12 



If the last rule was 1. (Basic transition), using u |= (T,U) and (T,U) — (T',U r ), 
then ua \= (X", [/') satisfies the conditions by (the induction hypothesis and) Proposi- 
tions El H We have EqLv(T, U)-l< EqLv(T", U'). If ua G pref(OW(T , t/ )) then u G 
pref(OW(T ,C/ )), hence OW{T,U) = u\OW(T ,U Q ) and thus a G pref(OW(T, [/)); 
this implies OW(T", U') = a\OW(T, E7) and thus OW(f, C/') = (uo)\OW(T 0) U ). 

If the last rule was Limit subterm replacement (2.), so from u \= (E(T),U), \v\ < \u\, 
v |= (T,F(T)), F 7^ x\ we have derived u \= (E(F bmi ),U), then the conditions 1. and 3. 
(for u |= (E(F hmi ), U)) follow from (the induction hypothesis and) Propositions 1221 and l2Tj 
the condition 2. follows from Point 3. of Observation [2j 

Rule 3. (Symmetry) obviously preserves the conditions. □ 

Corollary 32 T ^ U iff e (= FAIL. 
Proposition 33 (Soundness) 

Ife (= (T)t0 NOP /or a/Z (T, [/) e {(T , C/ )} U GInst(-B) i/ien B is sound and T ~ C/ . 
Proof: By contradiction. 

Suppose the assumption holds but there is some (Tq,Uq) G {(T ,Uo)} U GInst(S) with 
the least finite eq-level. Take the longest (offending) prefix u of some w G OW(Tq, U' ) 
such that u \=(ti v u' ) NOP. The rule deriving u \=(t^u' ) NOP could not be the Basis 
rule since this supposes u ^ e and u \=(r^,u{,) (T, U) where (T, U) G GInstS but Point 
2. of Proposition I3T1 implies that EqLv(T, U) is smaller than the eq-level of any pair in 
GInst(B). The same Point 2. also implies that the deriving rule could not be Bottom-up 
progression since this presupposes ua \=rr',u') NOP for a longer prefix ua of the above 

w g owe/;/;). - ■ - ^ 

So we have a sound system, on condition B is sound. But we note that an algorithm surely 
cannot process infinitely many pairs (T, U) G GlNST(i3) (to show e \=(t,u) NOP for each 
of them). Fortunately, it suffices to consider a "critical instance" for each pair (E, F) in B 
which has the least eq-level among the ground instances of (E, F): 

Definition 34 The critical instance of a pair (E(xi, . . . ,x n ), F(xi, . . . ,x n )) is the pair 
(Ea,Fa) where a = [Li/xi, . . . , L n /x n ] for fresh nullary nonterminals Li (extending Q) 
such that Li fa V for any V ^ L t ; e.g., Li gets its own special action ^ and a rule 

Li — *-} Li. By CRlTlNST(i3) we mean the set of the critical instances of the pairs in B. 

Proposition 35 For any E(xi, . . . , x n ), F(xi, . . . , x n ), and any T 1; . . . , T n , 
EqLv(E(L u ...,L n ), F{L X , . . . , L n )) < EqLv(E(T 1} . . . , T n ), F{T X , . . . , T n )) 
if Li, . . . , L n do not occur in E,F. 

Proof: Proposition |2"51 shows that if we had k x = EqLv(E(L 1 , . . . , L n ), F(L ly . . . , L n )) > 
k 2 = EqLv(£/(T 1 , . . . , T n ), F(Ti, . . . , T n )) then there were an offending prefix u for 
(E(Ti, . . . , T n ), F(Ti, . . . , T n )) exposing an equation X{ = H{x\, . . . , x n ) for E, F, which 
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also means k 2 > \u\; but then (E(Li, . . . , L n ), F(L\, . . . , L n )) — > (Li, H(Li, . . . , L n )) 
where Lj 7^1 H(L\, . . . , L n ), and thus k\ < |it| (a contradiction). □ 

The next lemma summarizes some important ingredients for the decidability of trace equiv- 
alence for det-first-order grammars, showing that it is now sufficient to prove the complete- 
ness of |=. 

Lemma 36 For a det-first-order grammar Q and a finite set B of pairs of terms, if 

£ [= (Tjl0 NOP for all (T, U) G {(T , U )} U CritInst(£) (3) 
then B is sound and To ~ Uq. Moreover, the condition (jHJ) is semidecidable. 

4 Completeness of the predicate |= 

Definition 37 Given a det-first-order grammar Q , a finite set B of pairs of regular terms 
is a sufficient basis if B is sound and we have: if Tq,Uq are regular ground terms where 
T ~ U then e ^ (To ,u ) NOP (wrt Q, B). 

We now aim to prove that any det-first-order grammar (in normal form) has a sufficient 
basis. We use implicitly the fact that if T ~ Uq then u \={t ,u ) (T, U) implies T ~ U, 

TRAC(T) = TRAC(Z7) = u\tRAC(T ) = m\tRAC(C/ ), and TRAC^(T) = TRAC U, ([/) = 

u\trac w (T ) = m\trac w (?7o) (recall Propositions [3H and EJ). We start with a simple 
observation: 

Proposition 38 Given a det-first-order grammar Q and a sound basis B, if for every triple 
T ,Uo,a where T ~ Uo and a G trac^(To) (— TKAC^iUo)) there is a prefix u of a such 
that u \=(t ,u ) NOP then B is a sufficient basis. 

Proof: Suppose To ~ Uq. For every maximal w G TRAC(To) (for which there is no wa G 
trac(Tq)) we have w \=(t ,u ) NOP by using Basic transition and Bottom-up progression. 
Thus the assumption that each a G trac^(T ) has a prefix u for which u \={t ,u ) NOP 
implies that e \=(t ,u ) NOP, by repeated use of Bottom- up progression. □ 

We now show a sufficient condition for the existence of a sufficient basis (Proposition S2D, 
first introducing some auxiliary notions to this aim. 

Definition 39 We assume a det-first-order grammar Q . Given (T , Uq) where T ~ Uq, an 
infinite trace a G TRAC W (T ) is s-bounded (for s G N) if it has a nonempty prefix u such 
that u \=(T ,u ) (E(T U • • • , T„), F(T U T n )) where the pair {E(x 1 , . . . , x n ), F{x u x n )) 
is sound and PresSize(TJ, F) < s. 

Grammar Q is s-bounded if for each T ~ Uq all a G trac w (T ) are s-bounded. 
Grammar Q has a stair-base of width n G N if there is a function g : N — > N such 
that: for every T ~ Uq and every a G TRAC tJ (T ) either a is g(0)-bounded or there 
are some (unspecified) terms T 1; . . . ,T n and infinitely many nonempty, increasing prefixes 
w , Wi,w 2 , ... of a such that Wj \=(t ,u q ) {Fj{Ti, T n ), Fj(T u T n )) for some (regular) 
Ej, Fj with PresSize(Tvj, Fj) < g(j), for all j = 0, 1, 2, . . .. 
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For illustration and later use, we first note a particular example of s-bounded traces where 
s = PresSize(xi, xi); then we show that the stair-base property is indeed sufficient. 

Definition 40 Given Q and an initial pair (To, Uq), w £ A* UA^ has a repeat if there are 
two different prefixes u\,Ui, \u\\ < \v,2\, of w and some T,U such that u\ \= (T,U), U2 |= 
(T,U). (By Subterm replacement we then derive u<i |= (U,U), where (U,U) = {x\cr,X\cr) 
for a = [U/xi]; so if (xi,Xi) G B then u 2 \= NOP.J 

Proposition 41 A det- first- order grammar Q has a stair-base of width iff Q is s-bounded 
for some seN. If Q is s-bounded then it has a sufficient basis. 

Proof: The first part follows directly from the definitions (if the width is then Q is 
g(O)-bounded). For the second part it suffices to define B as the set of all sound pairs 
(E, F) with PresSize(£', F) < s (recalling Proposition EH]). □ 

Proposition 42 // a det- first- order grammar Q has a stair-base (of some width) then it 
is s-bounded (for some s) and thus has a sufficient basis. 

Proof: Assume a fixed Q which has a stair-base of width n > 0, for a fixed function g. 
By Proposition HU we are done once we show that Q has also a stair-base of width n—1. 

So let us consider an arbitrary pair To ~ Uq and some a G TRAC^(To) which is not 
g(0)-bounded (for Tq,Uq); there are prefixes wq, W\, w^, ■ ■ ■ of a such that Wj \=(t ,u ) 
{EjiTi, . . . , T n ), Fj(T x , . . . , T n )) where PresSize(^, F 3 ) < g{j). (E , F ) is not sound 
(since a is not g(O)-bounded) but E (Ti, . . . , T n ) ~ F (Ti, . . . , T n ) (Proposition [311 Point 
1.). There is thus a shortest v exposing an equation for (E ,F ) (recall Proposition l28l); 
w.l.o.g. we can assume that the equation is x n = H(x\, . . . , x n ) (where H ^ x n ), and 
thus wov \=(T ,u ) {I'm H(T X , . . . , T n )). Since there are only finitely many pairs (E, F) with 
PresSize(Tv, F) < g(0), there is some s G N determined by Q and g(0) (independent of 
T ,U ,a) such that \v\ < s and PREsSlZE(i7) < s. 

By Limit subterm replacement we get w'j \= (Tvj(T 1; . . . , T n _i), Tj(T 1; . . . , T n _i)) for 
j = 0,1,2,..., where w'j = w s+j} E'j = E s+j [H Hm " /x n ] and Fj = F s+j [H lim "/x n ]. So 
defining g'(j) = g(s + j) + f(s), for some trivial function / (e.g. f(s) = 2s, depending on 
the definition of the size of graph presentations), shows that Q has a stair-base of width 
n—1 (since our reasoning was independent of T , Uq, a). □ 

Now we aim to show that any det-first-order Q = (A/ - , A, 1Z) (in normal form) has a stair- 
base (of some width). We use further auxiliary notions, recalling w(X,i), M , B-Inc, and 
the (i-prefix form (d = M in our case). 

Definition 43 Given T and w G A* U A w , w exposes a subterm ofT in depth d if there 
is a prefix u of w such that Pj — > for some i. (Pj is the d-prefix ofT.) 
For T — >, w G A* U A u , T sinks by kj if w exposes a subterm ofT in depth 1; hence if 
T does not sink by w then Xx\ ...x m — > where X is the root nonterminal ofT. 
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Now we introduce a key ingredient, the notions of left- and right-balancing segments, with 
the right- and left-balancing pivots; we also use B for ranging over ground terms (which 
serve as balancing pivots). We assume that the underlying Q is in normal form (discussing 
this issue afterwards). 

Definition 44 A triple (T,B,v) where \v\ = M is an ^-balancing segment if T ~ Mq B, 
T — > and T does not sink by a proper prefix of v. B is called the (balancing) pivot of this 
segment, an r-pivot in this case. The £-bal-result £-Hes(T, B, v) of this segment is defined 
as follows: if T = XT[ . . . T' m , Xx x ...x m G(x 1 , x m ), and B = Pm {W u W n ) 
then £-Res{T,B,v) is the pair (G{V U . . . ,V m ), F(W U . . . ,W n )) where P^ F and 

Vi = F i (W 1 , . . . , W n ) where P* o ^ F { for % = 1,2,. ..,m = arity(X). £-Res(T,B,v) 
can be also presented as (E(Wi, . . . , W n ), F(W 1 , . . . , W n )) where E = G(Fi, . . . , F m ). 

An r-balancing segment (B,T,v), with the £-pivot B and r-HEs(B,T,v), is defined 
symmetrically. We say just "pivot" and "bal-result " when the side (i or r ) follows from 
context. 

Informally, Proposition SS] captures the following simple idea: if the "left-hand side" (lhs-) 
term does not sink by a segment of length Mo— 1, so it misses the opportunity to expose 
a depth- 1 subterm by a shortest word, then we can balance, i.e., replace its subterms (in 
depth 1 originally) by using the rhs-term (r-pivot), achieving a pair with bounded finite 
heads and the same tails, inherited from the r-pivot. By symmetry the same holds for the 
case of a non-sinking rhs-term and an £-pivot. In what follows we sometimes leave implicit 
the parts of the claims which follow by symmetry. 

Remark. The normal form assumption on Q is technically convenient (though not 
really crucial). Definition 1441 makes good sense also for w(X,i) = e (recall Proposition [151) . 
but Proposition [45] and some later reasoning would be slightly more complicated. An 
alternative to the normal form assumption would be adding a (harmless) derivation rule 
in Figure [T] enabling to replace an unexposable subterm arbitrarily. 

Proposition 45 Given an initial pair Tq,Uq, if (T,B,v) is an i-balancing segment and 
(V, W) = £-Res(T, B, v) then u (= (T, B) implies uv (= (V, W). 

Proof: Suppose the notation from Definition HH If u |= (T, B) then by Basic transition 
rule we get uv \= (G(T{, . . . , T' m ), F(W U . . . , W n )) and u(w(X,i)) \= (T^VA where Vi = 
Fi(Wi, . . . , W n ), for % = 1,2, ... ,m. Since \w(X, i)\ < M = \v\, by repeated Subterm 
replacement we get uv \= (G(Vi, . . . , V m ), F(Wx, ■ ■ ■ , W n )). □ 

Recalling Observations [20] and [TU1 we easily observe the following facts. 
Observation 46 

(1.) The bal-result of an t-balancing segment (T,B,v) is determined by the pivot B, the 
word v (of length M ) and by the root nonterminal ofT. 

(2.) The depth-size of (finite) terms F, G, F^ in the bal-result as in Definition\J^\ is bounded 
by M + B-lNC(Mo). 
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(3.) The number n of tails in the bal-result (E(Wi, . . . , W n ), F(Wi, . . . , W n )) has an upper 
bound determined by Q (since M is determined by Q). 

(I) If e-REs(T,B,v) = (V,W) = (EjW 1 ,...,W n ),F(W 1 ,...,W n )) where E = 
G(Fi, . . . , F m ) as in Definition and V — > where w exposes a subterm of V in depth 
(1 + B-Inc(Mo)) then w necessarily exposes a subterm ofV (of the form Fi(Wi, . . . , W n ) ) 
which is reachable from B by some w(X, i) (of length < M Q ). 

We now try to use the possibility of balancing along an infinite a G trac w (To), for a 
pair To ~ Uq, to show that a allows a stair-base of width n, with a function g, which 
are independent of To, Uq, a (i.e., with n and g determined just by grammar Q). We first 
observe that if there are only finitely many balancing opportunities then a allows a repeat 
(recall Definition l4"0l) and the condition is trivial: 

Definition 47 Assume an initial pair T ~ Uq and a fixed a G trac^(T ). 
For (the triple u, T, U such that) u \=(t q ,u ) (T, U), u being a prefix of a, we define the next 
^-segment as the i-balancing segment (T',B,v) for the shortest w (if there is some) such 
that uwv is a prefix of a and (T,U) — > (T',B). The distance of this next i-segment is 
defined as \w\. Similarly we define the next r-balancing segment for u \=(t ,u ) (T, U). 

Proposition 48 Given To, Uq, a as in Definition \J1[ if there is no next l-segment and no 
next r -segment for some u \=(t ,u ) (T, U), u being a prefix of a, then a has a repeat. 

Proof: Consider performing a' = u\a = 010203 . . . from T; let T — T% — ^ T2 T3 — ^> 

• - •. If there were a (first) segment T T+i ■ ■ ■ 1 T i+Mo where T is a subterm 
of T but none of T i+1 ,T i+2 , . . . ,T i+Mo is a subterm of T (of T, in fact) then we had an 
^-balancing segment. Hence each T{ is reachable from a subterm of T by a word of length 
< Mo, which means that there are only finitely many different T. Similarly for U on the 
rhs. This guarantees a repeat. □ 

Proposition 49 

Any det- first- order grammar Q in normal form has a stair-base (of some width). 

Proof: We assume a det-first-order grammar Q = (A/", A, TV) in normal form, a pair 
To ~ Uq and a fixed a G TRAC aJ (T ); we further write |= instead of \=(t ,u )- Assuming 
that a has no repeat, we show that it has a stair-base of width n, with function g, where 
n, g are independent of To, Uq, a. 

We will present a as U1V1U2V2U3V3 . . . where \vi\ = M , attaching to each Vi a triple 
(sidei,Ti,Ui) and a pair (T/, U[) such that sidei G {£, r}, (Tj, is a szdej-balancing 

segment, (T/, U-) = sidei-RES^U^Vi), (T ,U ) {T U U X ) and (T/, U-) ^4 (T i+l ,U l+1 ) 
for i — 1, 2, 3, — Hence U1V1 . . . u^iVi-iUi \= (Tj, U) and uiv± . . . UiVi \= (T/, U-). We note 
that each has the corresponding pivot B^, i.e. one of Tj, U, depending on sidei. 

{Ti,Ux,Vi) is defined as the next ^-segment or the next r-segment for e \= (Tq,U ); if 
both exist, the one with the smaller distance is chosen (recall Definition H7|) . and we prefer 
£, say, to break ties. This also induces u\ (where (T , Uq) — h (T, U{j). 
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Suppose u\Vi . . . UiVi have been defined, and assume that (T$, Ui, Vj) is an ^-segment (so 
Bi = Ui is an r-pivot; the other case is symmetrical). If for U\V\ . . . UiVi |= (T-, U-) there is 
the next ^-segment with the distance at most 

Mi = (1 + B-Inc(Mo)) • M (4) 

then we use this segment to define Ui+iVi+i etc.; there was no switch, we have side^i = 
sidei = £. If there is no such "close" ^-segment (since the i-side terms keep sinking), we 
note that a subterm of T- in depth (1 + B-Inc(M )) has been exposed by w where \w\ = Mi 
and u\V\ . . . UiViW is a prefix of a; let (7?, 17/) ^ {T",U'^). Point 4. in Observation [46] 
implies that T" is reachable from Bi, by a word arising from ViW by replacing a prefix Viw' 
with some w(X,j). Here we define (7i+i, C/i+i, Ui+i) as the next £- or r-segment with the 
least distance for wit> i . . . UiViW \= (7f, U") (so w is a prefix of Wj+i). This might, but also 
might not, mean a switch of the pivot side. 

Anyway, Sj+i is reachable from 7?j by where either Wi = fjWj+i or Wi arises from 
ViU i+ i by replacing a prefix Ujii;' of length < M + M x by a (shorter) word w(X,j). We 
thus get a pivot-path 

#1 ► # 2 > -t>3 ^ ' - - 

We note that if some Wi is longer than M + M 1; Wj = w^w" where \w^\ — M + Mi, then the 
"pivot-path sinks" in any segment of w" of length M , i.e.: for any partition w" = w'^vw^, 

\v\ — Mo, we have Bi W\ — % W2 where Wi sinks by u. 

This implies for any B\ — ^ V — > where u/3' — (3 — W1W2W3 . . . that there is a 
nonempty prefix vl of (3' of length at most 

M 2 = (M + Mi) + (1 + B-Inc(M + Mi)) • M (5) 

such that V 7?j (for some i) or sinks by vl . (Informally: any segment V —¥ of 

the pivot path B\ — % with length \w\ = M 2 either contains a pivot Bi or sinks.) Hence 
if (3 exposes subterms of Bi in all depths then infinitely many pivots Bi are equal (since 
reachable from subterms of B\ by words of length < M 2 ); Point 1. in Observation 1451 shows 
that a would then have a repeat. 

So there is the maximal depth d such that (3 = W1W2W3 . . . exposes a (unique) subterm 

u 3' 

V\ of Bi in depth d; hence B\ — > V\ — > where up = (3 and V\ does not sink by (3' . Let 
Vi = P^ o (Ti, . . . ,T n ) be the M -prefix form of Vi, and let k e N be the least such that 

B\ — % Vi Bk Bt+i • ■ • {v! being a prefix of /?'). 

Then pivots Bk+j, j = 0,1,2,..., are of the form Gj(T\, . . . , T n ) where Gj are fi- 
nite terms in which each occurrence of a variable has depth M at least. Moreover, 
Depth(Ctj) < M + B-Inc(M 2 ) ■ (j + 1) (by the above "contains a pivot or sinks" fact). 

Hence the bal-results (T£ + -, U' k+ j) for B k+ j, j = 0,1,2,... are of the form 
(Ej(Ti, . . . , T n ), Fj(Ti, . . . , T n )) where Ej, Fj are finite terms with the depth-size bounded 
by g'(j) for some g' determined by the grammar Q (recall Definition HU Point 2. in Ob- 
servation HH1 and the fact that M , M 1; M 2 , B-Inc are determined by Q). There is thus 
g : N ->■ N (independent of T , U , a) such that PresSize(J3 3 -, Fj) < g(j), for j = 0,1,2,.. .. 



18 



Point 3. in Observation |4"61 thus implies that Q has a stair-base (of some width). □ 
In fact, we have thus shown the next completeness lemma, and the main theorem. 

Lemma 50 (Completeness) 

For each det- first- order grammar Q in normal form there is a sufficient (sound) basis B. 
Theorem 51 Trace equivalence for deterministic first-order grammars is decidable. 

For deciding To ~ Uq , an algorithm based on soundness and completeness is clear (using 
the effective manipulations with graph presentations of regular terms): when we are allowed 
to generate any finite basis for a given initial pair T , U then both questions "T ^ U ?", 
"T ~ Uq ?" are semidecidable; when verifying T ~ U , we have to verify all (critical 
instances of) pairs included in the basis as well. 

Remark. By inspecting the proofs we could note that a sufficient basis for a det-first- 
order grammar (in normal form) is, in fact, computable (since we now know that the 
value s determined by Q and g(0) in the proof of Proposition 02] is computable) but this 
computability does not seem much helpful. 

Conclusions 

The presented proof of the decidability of trace equivalence for det-first-order grammars 
routinely applies to the dpda language equivalence, as also shown in Appendix 1. The 
novelty here is the presentation in the framework of first order terms, resulting in a proof 
which seems technically simpler than the previous ones. 

Appendix 2. gives another look at the complexity result by Stirling [6], showing that 
the framework of first-order terms can be useful there as well. 
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Appendix 1. 

5 DPDA language equivalence problem presented via 
trace equivalence for det-first-order grammars 

A deterministic pushdown automaton (dpda) is a tuple M = (Q,A,T,A) consisting of 
finite sets Q of (control) states, A of actions (or terminals), T of stack symbols, and A of 
transition rules. For each pair pA, p G Q, A G T, and each a G A U {e}, A contains at 
most one rule of the type pA — > qa, where q G Q, a G T*. Moreover, any pair pA is 
(exclusively) either stable, i.e. having no rule pA — qa, or unstable, in which case there 
is (one rule pA — qa and) no rule pA — ga with a G „4. 

A dpda M generates a labelled transition system (Q x T*, ^4U {^j, { — >} a eAu{e}) where 
the states are configurations qa (q G Q, a G T*). Having our grammars in mind, we view 
a rule pA — > qa as pAx — > qax (for a formal variable x), inducing pAf3 — > qaf3 for 
every j3 G T*. The transition relation is extended to words w G A* as usual; we note that 
pa g/3 can comprise more than \w\ basic steps, due to possible "silent" e- moves. Each 
configuration pa has its associated language L(pa) = {w G A* \ pa — > qe for some q}. 
The dpda language equivalence problem is: given a dpda M and two configurations pa, q(3, 
is L(pa) = L(qf3) ? 

Remark. It is straightforward to observe that this setting is equivalent to the classical 
problem of language equivalence between deterministic pushdown automata with accepting 
states. First, the disjoint union of two dpda's is a dpda. Second, for languages Li,L 2 C S* 
we have L\ = L 2 iff L\ ■ {$} = L 2 ■ {$}, for an endmarker $ ^ E; so restricting to prefix-free 
deterministic context-free languages, accepted by dpda via empty stack, does not mean 
losing generality. 

Each dpda M can be transformed by a standard polynomial-time algorithm so that 
all ^-transitions are popping, i.e., of the type pA — > q, while L(pAa), for stable pA, 
keep unchanged. (A principal point is that a rule pA — qBa where qB qi(3\, . . ., 
qB can be replaced with rules pA — ^ qjfija; unstable pairs pA enabling only an 

infinite sequence of e-steps are determined and removed.) 

It is also harmless to assume that for each stable pA and each a £ A we have one 
rule pA qa (since we can introduce a 'dead' state q d with rules q^A — % q d A for all 
A G T, a G A, and for every 'missing' rule pA — —> .. we add pA — —> qdA). L(pa) are 
unchanged by this transformation. Then w G A* is not enabled in pa iff w = -uf where 
pa — > qe (for some q), so w G L(pa), and v ^ e. This reduces language equivalence to 
trace equivalence: 
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L(pa) = L(q/3) iff Ww G A* : pa — > qf3 — >. 

Proposition 52 The dpda language equivalence problem is polynomial-time reducible to 
the deterministic first-order grammar equivalence problem. 

Proof: Assume an (e-popping) dpda M = (Q,A,T,A) transformed as above (so trace 
equivalence coincides with language equivalence). We define the first-order grammar Qm = 
(J\f,A,TZ) where Af = {pA \ pA is stable} U {J-}; each X = pA gets arity m = \Q\, and 
1 is a special miliary nonterminal not enabling any action. A dpda configuration pa 
is transformed to the term T{pa) defined inductively by rules l.,2.,3. below, assuming 

Q = {<n,q2,---,q m }- 

1. T{qe) = ±. 

2. If qA q { (qA is unstable) then T(qA/3) = T(ft£). 

3. If qA is stable then T(qA/3) = XT(qi0) ■ ■ .T{q m fi) where X = qA. 

4. T{qiX) = Xi. 

Rule 4. is introduced to enable the smooth transformation of a dpda rule pA — — >■ qa, 
where a G A, rewritten in the form pAx — > qax, to the QM-^ule T(pAx) — > T(qax), i.e. 
to Xx\ . . . x m — > T(qax), where X = pA. Thus 71 in Qm is defined (with no e-moves). 
We observe easily: if pAa qa (recall that e-steps are popping) then T(pAa) = T(qa); 
if pAa — > q(3a (a G A, pA stable) then T{pAa) — % T(q(3a). This also implies: if 
pa — > qe then T{pa) — > _L. Thus 

L(pa) = L(qf3) iff (Vu> G A* : pa <^ q/3 ) iff T(pa) ~ T(g/3). 

We note that T(qa) can have (at most) 1 + m + m 2 + m 3 + • • • + m' a ' subterm-occurrences, 
but the natural finite graph presentation of T{qa) has at most 1 + m(|a| — 1) + 1 nodes 
and can be obviously constructed in polynomial time. □ 

Appendix 2. 

6 A complexity bound 

We have, in fact, not fully used the potential of the pivot-path B\ — ^ B 2 — ^» B 3 —h ■ ■ ■ 
discussed in the proof of Proposition H9j For showing the existence of a sufficient basis it 
was sufficient to use just the stair-base subterm V\ of B\. We will now explore the idea 
of the described balancing strategy further, which allows to derive a concrete computable 
function bounding the length of potential offending words (i.e., the eq- level) when given 
(a det-first-order grammar Q and) an initial pair (T , Uq). 

For our aims (in the context of upper complexity bounds), an elementary function is 
a function (of the type M fc — > N) arising by a finite composition of constant functions, 
the elementary operations +, — , x, , and the exponential operator f, where a n = a n . 
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When we say that a number is simply bounded, we mean that there is an elementary function 
of the size of Q giving an upper bound; e.g., the constants M , M x , M 2 , the number of tails 
in the Mo-prefix form, the depth-size of the heads in any balancing result, etc., are obviously 
simply bounded. 

The "first" nonelementary (hyper)operator is iterated exponentiation yf, also called 
tetration: a ft" n = a f (a f (a f (. . . a f a) . . .)) where f is used n-times. 

Our analysis will yield the following bound on the length of offending words, which has 
an obvious algorithmic consequence: 

Theorem 53 For any triple Q, T ,Uq with the size InSize (of a standard presentation), 
where Q is a det- first- order grammar and (T , Uq) is an initial pair such that T ^ JJq, there 
is a sequence of actions which is enabled in just one ofT , Uq and its length is bounded by 
2 j"f /(InSize), where f is an elementary function independent of Q , T ,Uq. 

Corollary 54 Trace equivalence for deterministic first-order grammars can be decided in 
time (and space) 0(2 ff ^(InSize)) for an elementary function g. 

The analogous claims hold for language equivalence of deterministic pushdown automata, 
as follows from the reduction in the previous section. 

We now aim to prove Theorem [53j In the rest of this section we assume a fixed det-first- 
order grammar Q = (A/", A, TV) in normal form, if not said otherwise. 

Later we will consider a fixed initial pair (T , U ), where T ^ U , and a fixed offending 
word a G A* for (To,Uq). The word a is (now) finite, and our aim is to show an ap- 
propriate upper bound on its length which will prove Theorem [531 To this aim, we first 
show an "upper-bound tool" (Proposition [56] with Corollary [571) . and then we demonstrate 
that the balancing strategy along our finite a (the same strategy as used in the proof of 
Proposition H9] along the infinite a there) guarantees that the upper-bound tool can be 
applied to bound the length of a. 

We start with noting a possible new type of sound subterm replacement; roughly speak- 
ing, if a pair of heads repeats then an equation (if some is exposable) is available for a 
potential application. (This new subterm replacement serves just for our reasoning, we will 
not extend the definition of |=.) In the next proposition, it might help to imagine that we 
have u \= [E(U X ...U n ), F(U X . . . U n )) and v |= (Eiy 1 ...V n ), F{Vy . . . V n )) where \u\ < \v\ 
and we would like to label v also with (E(V[ . . . VJQ, F(V{ . . . V£)), where V- arises from Vj 
by possible replacing of some occurrences of U n with H hmn (Ui, . . . , Z7 n _i). 

Proposition 55 Assume a pair (E(xi, ... ,x n ), F(x x , ... ,x n )) (of regular terms) for which 
there is a shortest w £ A* exposing an equation, w.l.o.g. say x n = H(xi, . . . , x n ) (H ^ x n ). 
Let us consider the following three pairs 

[E(U X ...U n ), F(Ut . . . U n )), [E{V X ...V n ), F{V X . . . V n )), (E(V{ ...V^), F(V{ . . . V n )), 

with eq-levels k x , k 2 , k 3 , respectively, where Vj = Gj(U n ) and Vj = Gj(H hnin (Ui, . . . , U n _ x )) 
(for j = 1, 2, . . . , n). Then k 3 > min{A;i, k 2 }, and k 3 = k 2 if k x > ki- 
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Proof: We note that U n ~ fc H(U x ,...,U n ) ~ fc H limn (U x , . . . , U n _ x ) where k = 
max{ki — \w\, 0}; hence Vj ~ k V- for j = 1, 2, . . . , n. 

Suppose now k% < min{/ci, ^2}- Then there is an offending prefix w' for the third pair 
which exposes an equation for E,F (recall Proposition 1281) ; necessarily \w'\ > \w\. We 

thus have (E(V{, . . . , V£), F(V{, . . . ,V£)) {V(, G(V{, V£) (or vice versa) for some i 
and G, where EqLv(\//, G{V{, . . . , V§) = k 3 ~ \w'\ = k'. But EqLv(^, G(V x , V n )) > 
k 2 — \w'\ > k'+l and Vi ~fc'+i V( (since k = k x —\w\ > k'), which yields a contradiction. 
Similarly we would contradict the case ki < min{fci, k^}. Hence the claim follows. □ 

A simple corollary is that if u \= (E(T), F(T)) and v \= (E(e(T)),F(e(T))), for a 
regular term e = e(x\) (a "1-tail extension"), where \u\ < \v\, then if v is an of- 
fending prefix for (T , U ) then EqLv (E (e(T)), F(e(T))) is independent of T (since 
EQLv(£(e(T)),F(e(T))) = EQLv(E(e(H lim ^)), F{e{H Um ^))) for the appropriate H). 
Hence if we also have v! (= (E(T'),F(T')) and v' \= (E(e(T')), F(e(T'))), where \u'\ < 
\v'\ < \v\ then it is impossible that both v,v' are offending prefixes for (T , Uq). 

We now show a generalization, which seems related to the Subwords Lemma in [1]. 
We use a visually more convenient "two-dimensional" notation for (composed) terms: the 
first rectangle below is a shorthand for Eo\Oi - ■ ■ o r where aj = [e{/x\, . . . , en/x n ]; it 
also presupposes that the variables occuring in all terms in the rectangle are from the set 
{x\, . . . , x n } . 

Given a (head) pair (E(xi, . . . , x n ), F(xi, . . . , x n )), 



and n tuples, called head extensions, (e\, . . . , e*), 



(e?,...,e£), (e",...,e^), where E,F and e) = 


E 


F 


e l j(xi, . . . , x n ) are regular terms, we call (E', F') an ex- 


e l • • • e n 


ph ph 


tended head pair if it can be presented as depicted, for 


pi-2 ph 


p*2 p«2 


< r < n and 1 < i x < i 2 < • • • < i r < n. We note 






that there are 2 n such presentations. By (E' max , F^^), 


e l • • • e n 


p*>- pV 
e l • • • e n 



called the maximal pair, we denote the pair with r = n 



and i x = 1, — 2, . . . , i n = n. 

Proposition 56 ylsswme . . . , x n ), F(x x , . . . , x n )), (e\, . . . , e l n ) , (e\, . . . , e 2 n ) , 

(e™, . . . , e") as above, and consider a tuple T x , . . . ,T n (of regular ground terms). 
Ifk = EQLv(E^ ax (T 1 , . . . , T n ), i^(Ti, . . . , T n )) is less than 

EqLv(£"(Ti, . . . , T n ), F'(Ti, . . . , T n )) for any other extended head pair (E', F') then k is 
independent of T x , . . . ,T n . 

Proof: The claim is trivial for n = 0. We prove it for n > 0, assuming it holds for n— 1. 
If there is no w G A* exposing an equation for E, F then the claim is trivial since 
EqLv(E(U x , ...,U n ), F(U X , U n )) is independent of U x , . . . , U n (for any U x ,...,U n ). So 
we assume a shortest w G A* exposing an equation for E, F, w.l.o.g. x n = H(x x , . . . , x n )- 
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,T n ) where E',F' 
1 gives rise to the 



Each pair E'(T U . . . , T n ) } F'(T h . 
is an extended head pair with i\ 
depicted pair, by replacing each e/ ( ) with 

e[ (x\, . . . , = e e J [H hmn /x n } and by omitting the 

now superfluous T n and e« (for ij ^ 1). This procedure 
is independent of trees 7\, . . . , T n ; these are handled as 
"black boxes" . 

We thus get 2 n_1 pairs, with the head pair (E, F) = (E(e\ . . . e n ), F(e\ . . . e n )) and head 
extensions (ef, . . . , e^_i), (ef, . . . , e„_i), . . ., (e", . . . , e^_i). Repeated use of Proposition 




for the triples of the form 



E 



1 



Ti...T n 



1 



T x ...T n 




E 



ei 1 ... el 



ie-i 

■<-n 



1 • 

e l . . . e n 

i e+1 i t+1 



T x ...T n 




nc 



guarantees that the maximal pair (in the new 2™ _1 pairs) has the same eq- level as t 
original maximal pair (in the originally assumed 2 n pairs), and this eq- level is less than 
the eq-level of any other pair. We can thus apply the induction hypothesis. □ 

For stating an important corollary we introduce the following notion (which assumes 
a fixed triple Q,T ,U ). Given a head pair E(x\, . . . , x n ), F(xi, . . . , x n ) and head ex- 



tensions (e|,...,e n ), (. 
{EmaxyTi, . . . , T n ), F max (Ti, . 
head pair (E', F') there is u, 



e"), we say that, for a tuple Ti,...,T n , the pair 
. , T n )) is saturated on level meNif for each other extended 
\u\ < m, such that u |= (E'(Ti, 



,T n ),F'(T h ...,T n )). 



Corollary 57 If u \= (U,U') for an offending prefix u for (Tq,Uq) where (U,U r ) can be 
presented as a pair (E' max (Ti, . . . ,T n ), F^^iTi, . . . ,T n )) saturated on level \u\ then there 
cannot exist an offending prefix v, \v\ ^ \u\, and a tuple T[, . . . such that v \= (V, V) 
where (V, V) can be presented as (E' max (T{, . . . , T^), F^ nax {T[, . . . , TJQ) saturated on level 
\v\ (where the head pair E,F and the head extensions are the same in both cases). 

We now fix T ,U , where T ^ JJ , and a (finite) a G OW(T , f/ ), and use the same 
(balancing) strategy along a as we used along the infinite a in the proof of Proposition HH1 
i.e., we present a in the appropriate form uiViU 2 v 2 ... as long as possible. We note that 
we are guaranteed that a does not allow a repeat, since it is offending; if u |= (T, U) for a 
prefix u of a then EqLv(T, U) = EqLv(T , Uo) — \u\. Instead of an infinite pivot path we 
now get a finite pivot path 



Bo 



Ul 2l 
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maybe with k = (i.e., with no balancing step at all). But recall the pivot-path property: 
each segment V — > of length \w\ = M 2 either contains a pivot, i.e. V — > Bi for a prefix 
u of w, or sinks, i.e. V sinks by w. 

Each pivot Bi, for i — 1,2, ... , k—1, has the unique subterm V{ (maybe Vi = Bi) which 
is exposed by a proper prefix of Wi but none of its proper subterms is thus exposed. This 
yields the path 



Bl ^ Vi ^ B 2 ^ V 2 ^ B 3 



— > Bk-i — )• Vfc_i — > Bk 



(6) 



where wn can be empty but are nonempty and \wii\ < Mi. We note that for each 
segment Vi B i+ i -^f 1 V^+i we either have that V, does not sink by Wi2Wi + i i i or Vi + ± is 
a subterm of Vi. 

Definition 58 We call a subsequence (io, ii, . . . ,i r ) of the sequence (1, 2, ... , k—1) a stair 
sequence if for each j G {0, 1, . . . , r— 1} we have that Vi- does not sink by w where Vi 3 — >■ 
V ij+1 is the appropriate segment of (TJj, so w = w^^ 2 )W ij+1 . . . 

A stair sequence is maximal if it is not a proper subsequence of any other stair sequence. 



Proposition 59 // (i , i 
be presented as 



i- 



is a maximal stair sequence then V^, V ii: Vi 



Vi can 



V 



■Hi 



Gq 

T x ...T n 



Gi 



e\ . . . e\ 
Tt...T n 



V, 



G 2 
e\...e 



e\ . . . e\ 
Ti...T n 



V, 



where Gj are Mo-prefixes (so n is simply bounded) and DEPTH(e*) < B-Inc(M2 




Proof: Assuming a maximal stair sequence, we first show that V^. +1 is a subterm of Vi. + \\ 
For any i > 1 such that ij+£ < ij + i we must have that Vi j+ £ sinks by w' for the appropriate 



segment Vi 



+£ 



V 



J+l' 



which implies that there is £' such that ij + £ < ij + £' < ij + i 



where Vi i+ e> is a subterm of V. 



ij+e 



By definition, for the segment Vi . 



Vi j+1 we have Yx\ . . . x„ 



F(x h 



, Xr 



where 



Y is the root nonterminal of Vi . and F is not a variable. Recalling the above pivot-path 
property and the fact that V ij+1 is a subterm of V ij+ i, we deduce that 1 < Depth(F) < 
1 + B-Inc(M2). This easily implies the claim. (Note that A can be just a variable. It is also 
possible that some Tj, e* get obsolete, do not really matter in the respective substitutions.) 

□ 



Proposition 60 There is an elementary function g, independent of Q,To,Uo,a, such that 
\oi\ < ^(InSize, €) where i is the length of the longest stair sequence. 



25 



Proof: For any ground term V which can be presented as V = F(W\, . . . , W m ) where F is 
a finite term and Wi are subterms of T or Uq, let us define size(V) as the least Depth(T) 
in such presentations; we note that if V is a subterm of V then size(V) < sizeiV). Since 
either To — 1 -+ B\ or Uq — ^ the above szze is well defined for all and Vi ; moreover, 
size(Vj+i) < sizeiVi) + M 3 where we put M 3 = 1 + B-Inc(M 2 ). 

We now show that if sizeiVi) > pM 3 (for p G N) then there is a stair sequence 
(io,h, ■ ■ ■ ,i p ) such that i p — i: Suppose Vi is a counterexample for the least i and some 
p; we necessarily have p > 1. Since Vf is a subterm of B\ and -Bi is reachable from a 
subterm of To or Uq by less than Mq moves, we surely have size(Vi) < M3; hence i > 1. 
We have size(Vi_i) > (p— 1)M 3 and is a subterm of VJ_i (since V£_i satisfies the claim 
for p — 1 and thus V^_i — )■ necessarily sinks); hence s2ze(Vi_i) > pM 3 . Then z— 1 > 1 
and we also have sizeiV^) > (p— 1)M 3 and Vi is a subterm of Vi_ 2 , so sizeiV^) > pM 3 ; 
continuing this reasoning leads to a contradiction. 

Hence if £ is the length of the longest stair sequence then we have size(Bi) < (£ + 2)M 3 
for each pivot Bi, which gives an elementary bound (in InSize and £) on the length of the 
pivot path, and thus an elementary bound on \a\ as well. □ 

Thus to prove Theorem [53J it is sufficient to show Proposition EU We show this by using 
Corollary [57] and a few combinatorial facts. (The combinatorial facts could be surely 
found in the literature in some form, e.g. for so called Zimin words, but we provide short 
self-contained versions tailored to our aims.) 

Definition 61 Given an alphabet S ; the empty word e is of type 0; for n > 1, a word 
w G X* is of type n if w = vuv for some v of type n—1 and some u, \u\ > 1. Each word 
w of type n > has a type-n presentation given by nonempty words t>i,t>2, • • • ,v n ; these 
give rise to words Wi, w 2 , ■ ■ ■ , w n , where Wi is of type i, as follows: W\ = v 1} w 2 = WiV 2 Wi 
(= v 1 v 2 v 1 ), w 3 = w 2 v 3 w 2 (= v x v 2 v x v 3 v x v 2 v x ) , w n = w n _iv n w n ^i, where w n = w. 

Proposition 62 For a type-(n+l) presentation of w G S* ; given by words v 1, v 2 , . . . , v n+ i, 
there are words U\,u 2 , . . . , u n , all beginning with the first symbol ofv\, such that u^u^ . . . u ir 
is a suffix of (the right quotient) w/v\ for all 1 < r < n and 1 < %\ < % 2 < • • • < i r < n. 

Proof: The words v 1, v 2 , . . . , t> n+ i give rise to Wi,w 2 , . . . , w n+ i as in Definition [6TJ We 
denote w\ = Wi/vi, and put Ui = ViV^iW^ for i = 1, 2, . . . , n (so w' i+l = w\ui = w^ViVi+iw'j). 
Hence 

= e Ui = V\v 2 

W 2 = V X V 2 U 2 = VxV 3 ViV 2 

W' 3 = fiV2fif 3 fif2 U 3 ^xV^ViV^xV^^ 

Since is a suffix of w'j for each j > i, it is sufficient to show that u^Ui 2 . . . Ui r is a suffix of 
w' ir+l . Since w' ir+1 = w' ir Ui r and by induction hypothesis we can assume that u^Ui 2 . . . Ui r _ x 
is a suffix of w[ and thus of w' ir , the claim is clear. □ 

For h G N, we define function fh : N — > N (recursively) and note the next proposition: 

A(0) = 0, f h ( n + l) = (l + f h (n))-h^ n K 
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Proposition 63 If |E| = h then each w G S* ; > fh(n), contains a subword of type n. 

Proof: By induction, using the pigeonhole principle. For n = the claim is obvious. 
Any word w of length fh(n + 1) necessarily contains two occurrences of some (sub) word 
u of length fh(n) separated by a nonempty word. Thus w = Viuv 2 uv 3 where |t> 2 | > 1 and 
u = U1U2U3 with U2 of type n; this means that the subword U2U3V2U1U2 is of type n + 1. □ 



Proposition 64 There is an elementary function g , independent of Q } Tq } Uq } a, such that 
the length of any stair sequence has an upper bound 2 g(size(Q)) . 



Proof: Given a maximal stair sequence (io, ii, ■ ■ ■ , i r ) and the presentation of 
^<o> Via ^i 2 ; • • • 1 Vi r as in Proposition [591 let us consider the pivot (sub)sequence 

The balancing results with 



B 
B 



io+ i, . . . , B ir _ 1+t (B ij+1 is the first pivot after 
i 0+ i, . . . , B ir _ 1+ i can be presented as 



E 
T x ...T n 



F 
T x ...T n 



e 1 . . . e n 
Tt...T n 



e l • • • e n 

T x ...T n 




F r - 



er 1 



e l • • • e n 

T x ...T n 



where Ei, F{, e\ . . . e l n are finite terms whose depth-size is simply bounded (unlike 
in/around Proposition 1561 where we considered general regular terms). Hence the tuples 
(Ei, Fi,e\, . . . , e l n ) can be viewed as elements of an alphabet with h elements, where h is 
bounded by an elementary function of size(Q). If r— 1 > fh(n+2) then the word 



(E, 



r-li 



-r-1 



r—l-i c i 



r-V 



(E r -2, F r _ 2 , e\ 2 , 



e- 2 ) ... (E^F^el, 



contains two different occurrences of a subword of type n+1, by Proposition By Propo- 
sition [62j from this subword we can extract a pair E, F (i.e., the "head-projection" of 
the first symbol of v±), and n words ("extension-projections" of u x , . . . ,u n ) (e\, . . . ,e^), 
(e 2 , . . . , e 2 ), . . ., (e", . . . , e"), where each (e{, . . . , e 3 n ) corresponds to the substitution aris- 



ing by composing the substitutions corresponding to a segment ef . . . so that: 

there are U\, U2 ■ ■ ■ , U n determined by the first occurrence of the type n+1 subword such 




U ) F' 

^n)i ± max 




U n )) is sat- 



that for each tuple 1 < i x < 12 < ■ ■ ■ < i r < n there is a pair 
in the above sequence, where the appropriate (E' max (Ui, 
urated on the corresponding level. Similarly for Vi, V2, . . . , V n determined by the second 
occurrence of the type n+1 subword. This would yield a contradiction with Corollary [571 
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Therefore r — 1 < f h (n + 2). From the definition of fh we can easily derive fh(n + 2) < 
h tt 9i( n ) f° r an elementary function g 1 . Since h and n are bounded by elementary 
functions of size(Q), the claim follows. □ 
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